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Abstract 

Social trust prediction addresses the significant 
problem of exploring interactions among users in 
social networks. Naturally, this problem can be for¬ 
mulated in the matrix completion framework, with 
each entry indicating the trustness or distrustness. 
However, there are two challenges for the social 
trust problem; 1) the observed data are with sign 
(1-bit) measurements; 2) they are typically sampled 
non-uniformly. Most of the previous matrix com¬ 
pletion methods do not well handle the two issues. 
Motivated by the recent progress of max-norm, we 
propose to solve the problem with a 1-bit max- 
norm constrained formulation. Since max-norm is 
not easy to optimize, we utilize a reformulation of 
max-norm which facilitates an efficient projected 
gradient decent algorithm. We demonstrate the 
superiority of our formulation on two benchmark 
datasets. 


1 Introduction 

With the increasing popularity of social networks, there ex¬ 
ist many interesting and difficult problems, such as friends 
recommendation, information propagation, etc. In this paper, 
we study the problem of social trust prediction, which aims 
to estimate the positive or negative relationship among the 
users based on the existing trustness information associated 
with them. This problem plays an important role in social 
networks as the system can block the invitation from some¬ 
one that the user does not trust, or recommend new friends 
who enjoys a high reputation. 

Naturally, the social trust problem can 
be formulated within the matrix completion 
framework _ ||Liben-Nowell and Kleinberg, 2007) 

iBillsus and Pazzani, 1998| ! That is, the (z,j)-th entry 

of the observed data matrix Z G is a 1-bit code 

implying that the z-th user trusts the j-th user if Zij — 1. 
Here, n denotes the number of users. However, what we 
observe is only a small fraction of the entries, whose values 
are zero. And our goal is to estimate the missing entries 
according to the 1-bit measurements in Z. 

Note that the problem is ill-posed if no assumption is im¬ 
posed on the structure of the data. To solve the problem. 


a number of methods are proposed. Generally, existing so¬ 
cial trust prediction methods fall into three categories. The 
first category is based on similarity measures or the struc¬ 
tural context similarity IINewman, 200 1[ |Chowdhury, 2010[ 
I IKatz, 195^ |Jeh and Widom, 2002| , rnotivated by the in¬ 
tuition that an individual tends to trust their neighbors, or 
the ones with similar trusted people. The second is based 
on low rank matr ix completion |Billsus and Pazzani, 19981 
yCai et al., 2010| |Huang et a/., 2013) , which assumes that 
the underlying data matrix is low-rank or can be approximate 
by a low-rank matrix. The third one models the problem as 
a binary classification one and utilizes techniques such as lo¬ 
gistic regression | |Leskovec et al, 2010| . 

Challenges. However, there are two issues emerging 
in social trust prediction which are not well characterized 
by the algorithms in previous works. First, the value of 
the observed entry is either 1 or —1, which is analogous 
to the binary classification problem. But in our problem, 
we are handling much more complex matrix data. For¬ 
tunately, ySrebro et al, 2004) presented a maximum margin 
matrix factorization framework that unifies the binary prob¬ 
lem for vector case and matrix case. The key idea in their 
work is a low-norm matrix factorization, which will also be 
utilized in this paper. Second, the locations of the entries are 
sampled non-uniformly, which gaps the theory and practice 
for a lot of matrix completion algorithms. To tackle this chal¬ 
lenge, we suggest using the max-norm as a convex surrogate 
for the rank function, which is shown to be superior to the 
well-known nuclear norm when addressing the non-uniform 
data ySalakhutdinov and Srebro, 20101 . 

Our contributions are two-folds: 1) To the best of our 
knowledge, we are the first to address the social trust predic¬ 
tion problem by utilizing a max-norm constrained formula¬ 
tion. 2) Although a max-norm constrained problem can be 
solved by SDP solvers and an accurate enough solution can 
be achieved, we here utilize a projected gradient algorithm 
that is scalable to large scale datasets. We empirically show 
the improvement of our formulation for the non-uniform 1 -bit 
benchmarks compared to state-of-the-art solvers. 


2 Related Work 

Social interaction is investigated intensively in the last 
decades. The social interaction indicates the friendship, sup¬ 
port, enemy or disapproval as shown in Figure[T] Online users 
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Figure 1; Illustration of the data matrix of social trust. Each 

column of the data matrix is a 

rating sample. 

Each entry is 


a tag that a user assigns to another user. The symbol “+” de¬ 
notes the “trust” relationship, denotes “distrust” and “?” 
means unknown relationship (that we aim to predict). Most 
of the relationships are unknown, so the data is sparse. Typi¬ 
cally, each individual has own preference and friendship net¬ 
work, making the data non-uniform. Also noted that the data 
matrix may not be symmetric. 


rely on the trustworthiness information to filter information, 
establish collaboration or build social bonds. Social networks 
rely on the trust information to make recommendation, at¬ 
tract users from other circles, or influencing public opinions. 
Thus, the exploration of social trust has a wide range of ap¬ 
plications, and has emerged as an important topic in social 
network research. A number of methods are proposed. 

One kind of methods are based on the similar¬ 
ity measurement. Specifically, Jaccard’s coefficient 
is commonly used to measure the probability two 
items that have a relationship. Inspired by the met¬ 
ric, Jeh and Widom |Jeh and Widom, 2002| proposed a 
domain-independent similarity measurement, SimRank. 
I|N ewman, 20011 directly dehned a score to verify the corre¬ 
lation between common neighbors. Some methods are based 
on relational data modeling, structural proximity measures 


and stochastic relational model 

Getoor and Diehl, 2005) 

(Liben-Nowell and Kleinberg, 2007 

OYuefa/.,2009|. The 


above mentioned methods are mainly derived from the 
solutions of link prediction. The link prediction is ori¬ 
ented to network-level prediction, whereas social trust 
prediction focuses on person-level. Another class of 
methods are derived from the collaborative filtering meth¬ 
ods, such as clustering tec hniques IjSarwar et al, 2001 , 
model-based methods (Hofmann a nd Puzi cha, 1999 , 
and the matrix factorization models (Srebro ef a/., 2003| 
(Mnih and Salakhutdinov, 2007| . However, the data matrix 
of trust has some structure properties different from the 


user-item matrix, such as transitivity. Meanwhile, the social 
trust in reality is extremely sparse. For instance, Facebook 
has hundreds of millions of users, but most of them have less 
than 1,000 friends. Besides, the people with similar person¬ 
ality tend to behave similarly. To sum up, the data matrix of 
social trust has both sparse and low-rank structure. Thus, the 
social trust prediction problem is especially suitable for the 
matrix completion model. That is the focus of our paper. 

The problem of matrix completion is to recover a low-rank 
matrix from a subset of entries | |Candes and Recht, 2009| , 
which is given by; 

min rank (AT) 

^ ( 2 . 1 ) 
s.t. Pn{X) = Pn{Z) 


where Z S is the data matrix, X is the recovered 

matrix, and H is the index set of observed entries. Th op¬ 
timization problem ( 12. Il l is not only NP-hard, but requires 
double exponential time complexity with the number of sam¬ 
ples | |Recht et al, 2010) . To solve the above problem, one 
alternative is to use nuclear norm as a relaxation to the rank 
function; 


rmn ||X||* 

s.t. Pa{X)=Pn{M) 


( 2 . 2 ) 


where ||X||* denotes the sum of singular values of matrix 
X. yCai et al, 2010| developed a first-order procedure 
to solve the convex problem ( 12.21) . namely singular 
value thresholding (SVT). | |Jain et al, 2010| minimized 
the rank minimization by the singular value projection 
(SVP) algorithm. | |Keshavan et al., 2010| solved the 
problem by first trimming each row and column with 
too few entries, then compute the truncated SVD of the 
trimmed matrix. Under certain conditions, it showed 
accurate recovery on the order of ndlogn samples (n is 
the number of samples, d is the rank of recovered ma¬ 
trix). With the rapid development of matrix completion 
problem, some more efficient methods have been proposed 


yCandes and Tao, 2010| | |Gross, 201 l)|Wang and Xu, 2012| |Huang et al., 2 

However, all the methods mentioned above use the 
nuclear norm as the surrogate to the rank, whose ex¬ 
act recovery can be guaranteed only when the data 
are sampled uniformly, which is not practical in real 
world applications. On the other hand, recent empiri¬ 
cal on max-norm ySrebro et al., 2004) shows promising re¬ 
sults for non-uniform data if one utilize the max-norm 
as a surrogate (Salakhutdinov and Srebro, 2010j . Notably, 
for some specific problems, such as collaborative hlter- 
ing, (Srebro and Shraibman, 2005] proved that the general¬ 
ization error bound for max-norm is better than the nuclear 
norm. More recently, yShen et al., 2014| reported encourag¬ 
ing results on the subspace recovery task (which is closely 
relevant to matrix completion). Since the social trust data is 
non-uniformly sampled, we believe that a max-norm regu¬ 
larized formulation can better handle the challenge than the 
nuclear norm. Our formulation is also inspired by a re¬ 
cent theoretical study on matrix completion with 1-bit mea¬ 
surement yCai and Zhou, 2013), which established a minimax 













































lower bound on the general sampling model and derived the 
optimal convergence rate in terms of Frobenius norm loss. 
Furthermore, there are several practical algorithms to solve 
max-norm regularized or max-norm constrained problems, 
see | |Lee et al, 2010| and | |Shen et al, 2014| for example. 

2.1 Overview 

After review of related work in Section ^ we introduce the 
notations and formulate the problem in Section |3] Then 
we give algorithm to solve the max-norm constrained 1-bit 
matrix completion (MMC) problem in Section ID Mean¬ 
while, we also provide an equivalent SDP formulation for the 
MMC, which can be accurately solved at the expense of effi¬ 
ciency. Then we report the empirical study on two benchmark 
datasets in Section|2 Section |6]concludes this paper and dis¬ 
cusses possible future work. 


3 Notations and Problem Setup 

In this section, we introduce the notations that will be used in 
this paper. Capital letters such as M are used for matrices and 
lowercase bold letters such as v denotes vectors. The i-th row 
and j-th column of a matrix M is denoted by m(i) and rrij 
respectively, and the (i, j)-th entry is denoted by Mij. For a 
vector V, we use Vi to denote its i-th element. We denote the 
norm of a vector v by ||v|| 2 . For a matrix M G Rf we 
denote the Frobenius norm by ||M||^ and ||AT ||2 denotes 
the maximum £2 row norm of M, i.e., 

’ t—1 

We further define the max-norm of M | |Linial et ah, 2007] , 
ll^llmax = jjy^%yT ^ (3-1) 


where we enumerate all possible factorizations to obtain the 
minimum. 

Intuition on max-norm. At a first sight, the max-norm is 
hard to understand. We simply explain why it is a tighter 
approximation to the rank function than the nuclear norm. 
Again, we write the nuclear norm of M as a factorization 
form I IRecht et al, 2010] : 


\\M\l 


1 

mm — 

U,V,M=UV"' 2 




Note that the Frobenius norm is the sum of the square of the 
£2 row norm. Thus, a nuclear norm regularizer actually con¬ 
strains the average of the £2 row norm, while the max-norm 
constrains the maximum of the £2 row norm! 

Given the observed data Z G we are interested in 

approximating Z with a low-rank matrix X, which can be 
formulated by, 

minimize ^ \\Vn{Z - X)\\p , 
s.t. rank(X) < d, 


otherwise. However, it is usually intractable to optimize the 
above program as the rank function is non-convex and non- 
continuous | |Recht et al, 20101 . One common approach is to 
use the nuclear norm as a convex surrogate to the rank func¬ 
tion. However, it is well known that the nuclear norm can¬ 
not well handle the non-uniform data. Motivated by the re¬ 
cent progress in max-norm ISalakhutdinov and Srebro, 20101 
|Cai and Zhou, 201 3| [Shm et a/., 2014r we use the max- 
norm as an alternative convex relaxation, which gives the fol¬ 
lowing formulation: 

s-t- II^ILax < 

where A is some tunable parameter. 


4 Algorithm 

The max-norm is convex and moreover, it can be solved by 
any SDP solver. Formally, we have the following lemma: 

Lemma 4.1 (| |Srebro et al, 2004) ). For any matrix X G 
RP^" and A G R, HAfUmax — if and only if there ex¬ 
ist A G RP^P and B G R"^", such that [^t ^ ] is semi- 
definite positive and each diagonal element in A and B is 
upper bounded by A. 

With Lemma l4n on hand, one can formulate Problem 13.21 
as an SDP: 


min 

x,A,B 


^\\Vn{Z-X)\\l, 


s.t. An < A^, Bjj < A^, 


■ A 

X^ 


X 

B 


t 0. 


Vi G [p], j G [n], 


(4.1) 


And this program can be solved by any SDP solver to obtain 
accurate enough solution. 

However, SDP solvers are not scalable to large matrices. 
Thus, in this paper, we apply a projected gradient method to 
solve Problem (13.2b . which is due to |Lee et al., 2010[. Akey 
technique is the reformulation of the max-norm (13.11 1. As¬ 
sume that the rank of the optimal solution X* produced by 
the SDP (14.1b is at most d. Then we can safely factorize 
X = , with U G RP^^^ and V G Combining 

the factorization and the definition, we obtain the following 
equivalent program: 

s.t. ||C/||2,.o < A, ||F||2,^ < A. 

Note that the gradient of the objective function w.r.t. 

V can be easily computed. That is, 

Vuf{Z,U,V) =Vn{{UV^ - Z)V) , 

VyfiZ, U, V) = Vn {{VU^ - Z^)U) . 


(4.2) 
U and 

(4.3) 


where H is an index set of observed entries and d is some 
expected rank. Vn{M) is a projection operator on a ma¬ 
trix M such that Pn{mij) = nriij if {i,j) G H and zero 


Here, for simplicity we define 

f{Z,U,V) = ^\\rn{Z-UV^)\\l 
























Algorithm 1 Max-norm Constrained 1-Bit Matrix Comple¬ 
tion (MMC) 

Input: Z G (observed samples), parameters A, initial 
solution (C/q) Vq), maximum iteration r. 

Output: Optimal solution (C/*, C*). 

1 : for t = 1 to r do 

2: Compute the gradient by Eq. (I4.31 i: 

V; = Vyf{Z,Ut-uV) \v=v,., ■ 

3: Compute the step size at according to Armijo rule. 

4: Compute the new iterate: 

Ut = - atU^), 

= U{Vt-i - atVl). 

5: end for 


The inequality constraints can be addressed by a projection 
step. That is, when we have a new iterate {Ut, Vt) at the t- 
th iteration, we can check if they violate the constraints. If 
not, we can proceed to the next iteration. Otherwise, we can 
scale the rows of U and/or V by — and/or -prip— re¬ 
spectively. In this way, we have the projection operator: 


n(M) 



if 11^112.00 > A, 

Otherwise. 


(4.4) 


If we further pick the step size at via the Armijo 
rule I Armijo and others, 1966[ , it can be shown that 
the sequence of {Ut,Vt) will converge to a stationary 
point I IBertsekas, 1999| . The algorithm is summarized in Al- 

gorithmin 


The benefits of applying the factorization on the max-norm 
are two-folds: 1) the memory cost can be significantly re¬ 
duced from 0{pn) of SDP to 0{d{p -f n)). 2) it facil¬ 
itates the projected gradient algorithm, which is computa¬ 
tionally efficient when working on large matrices (see Sec- 
tionl^l. However, note that Problem ( I4.21 l is non-convex. For¬ 
tunately, |Burer and Monteiro, 2005[ proved that as long as 
we pick a sufficiently large value for d, then any local mini¬ 
mum of Eq. (I4.21 i is a global optimum. In Section we will 
report the influence of d on the performance. Actually, in 
Algorithm[Tl the stopping criteria is set to be a maximum it¬ 
eration. One may also check if it reaches a local minima as 
the stopping criteria, as discussed in ||Cai and Zhou, 2013). 


4.1 Heuristic on A 

The A is the only tunable parameter in our algorithm. For 
our problem, note that the data is of 1-bit measurements, i.e., 
\Zij\ = 1 for {i,j) G fi. Also note that Ztj = u(i)v(j)^. 
Thus, |u(i)v(j)"''| = 1. So we have A > 1. However, if we 
choose a large A, the estimation \Xij \ may deviate away from 
1. We find that A = 1.2 lead to satisfactory improvement. 


5 Experiments 

In this section, we empirically evaluate our method for the 
matrix completion performance. We will first introduce the 
used datasets. In the experimental settings, we present the 
comparative methods and evaluation metrics. Then we report 
encouraging results on two benchmark datasets. We also ex¬ 
amine the influence of matrix rank d. 


5.1 Datasets 

We conduct the experiments on two benchmark datasets: 
Epinions and Slashdot. In these two datasets, the users are 
connected by explicit positive (trust) or negative (distrust) 
links (i.e., the 1-bit measurements in Z). The first dataset con¬ 
tains 119,217 nodes (users) and 841,000 edges (links), 85.0% 
of which are positive. The Slashdot dataset contains 82,144 
users and 549,202 links, and 77.4% of the edges are labeled 
as positive. Table [T] gives a summary description about the 
subset used in our experiment. 

It is clear that the distribution of links are not uniform 
since each user has his/her individual preference and own 


friendship network. Following I Huang et al., 2013) , we se¬ 
lect 2,000 users with the highest degrees from each dataset to 
form the observation matrix Z. 


Table 1: Description of 2 datasets 


Dataset 

Epinions 

Slashdot 

#of Users 

2,000 

2,000 

#of Trust 

171,731 

68,932 

#of Distrust 

18,916 

20,032 


5.2 Experimental Settings 

Baselines. We choose four state-of-the-art methods 
as baselines, including SVP | Jain ef a/., 2010|, SVT 


| |Cai et al., 2010), OPTSpa ce | |Keshavan et al., 2010) and 
RRMC jHuang et a/., 201 3] . Since SVT and RRMC need 
a specified rank, we tune the rank for these methods and 
choose the best performance as the final result. 

Evaluation Metric. Let T be the index set of all observed 
entries. We use two evaluation metrics to measure the per¬ 
formance, mean average error (MAE) and root mean square 
etTor (RMSE), computed as follows: 

MAE= Y. {X,o-M,,)/{\T\-P\), 

{z,j)&T\n 


RMSE= Y ix,j-M,,)y{\T\-\n\), 
V {i,j)€T\Q 


where \T\ denotes the cardinality of T. 

Training and Testing. We randomly split the dataset for 
training and testing. In particular, the number of observation 
measurements H for training ranges from 10% to 60%, with 
step size 10%. For each split, we run all the algorithms for 
20 trials, with the training data in each trail being randomly 
sampled. Then, we report the mean and standard deviation of 
MAE and RMSE over all 20 trials. 


































Table 2: MAE Results on Epinions Dataset 


Observed entries (%) 

Methods 

SVT 

OPTSpace 

SVP 

RRMC 

MMC 

10 

0.359±0.004 

0.289±0.019 

0.450±0.008 

0.576±0.001 

0.254±0.003 

20 

0.394±0.022 

0.236±0.005 

0.294±0.002 

0.518±0.002 

0.212±0.003 

30 

0.360±0.057 

0.219±0.009 

0.248±0.001 

0.460±0.002 

0.201 ±0.002 

40 

0.410±0.099 

0.205±0.008 

0.224±0.001 

0.418±0.002 

0.193±0.001 

50 

0.471 ±0.129 

0.197 ±0.007 

0.210±0.001 

0.386±0.001 

0.190±0.002 

60 

0.476±0.146 

0.197±0.003 

0.199±0.001 

0.362±0.002 

0.206±0.003 


Table 3: RMSE Results on Epinions Dataset 


Observed entries (%) 

Methods 

SVT 

OPTSpace 

SVP 

RRMC 

MMC 

10 

0.513±0.010 

0.530±0.021 

0.610±0.010 

0.650±0.001 

0.466±0.004 

20 

0.559±0.031 

0.456±0.005 

0.459±0.002 

0.606±0.002 

0.406±0.004 

30 

0.532±0.082 

0.422±0.011 

0.415±0.002 

0.563±0.002 

0.383±0.002 

40 

0.620±0.171 

0.406±0.015 

0.394±0.002 

0.533±0.002 

0.371 ±0.002 

50 

0.719±0.225 

0.398±0.016 

0.381±0.001 

0.509±0.001 

0.364±0.001 

60 

0.728±0.288 

0.403 ±0.009 

0.371 ±0.002 

0.491 ±0.002 

0.365±0.002 


5.3 Experimental Results 

We report detailed results from Table |2] to Table |5] Erom 
the results in Tables |2] and [3 we observe that MMC out¬ 
performs the other methods in terms of both evaluation met¬ 
rics on the Epinions dataset most of the time. In particular, 
when there are few observations 10 % (which indicates a hard 
task), MMC obtains the RMSE of 0.466, much better than 
OPTSpace (0.530), SVP (0.610) and RRMC (0.650). Except 
on the case with 60% observed entries, OPTSpace obtains the 
smallest MSE with 0.197, but our algorithm is comparative 
with 0.206. In a nut of shell, the gap between MMC and the 
baselines becomes larger as the fraction of observed entries 
decreases. 

Similarly, our method achieves the least MAE and RMSE 
on the Slashdot dataset (see Table |4] and |5]l. Eor instance with 
30 % observed entries, MMC obtains the MAE with less than 
0.4, much better than the comparative methods, such as SVT 
(0.513), OPTSpace (0.427), SVP (0.501) and RRMC (0.686). 
In terms of RMSE, in the case of 20% observed entries, the 
RMSE values of other methods are all above 0.7 while our 
method reaches 0.679. In sum, our method is superior than 
the comparative methods on two real-life datasets in terms of 
MSE and RMSE most of the time. 

Since we have studied the effectiveness of our method, here 
we examine the computational efficiency in Table |6l which is 
important for practical applications. To test the time com¬ 
plexity of the methods, we report the averaged time cost on 
the Epinions dataset with 10% observed entries and Slash- 
dot with 20% observed entries. To illustrate the trade-off be¬ 
tween accuracy and efficiency, we also report the MAE and 
RMSE. As we see, SVP is the most efficient method, whose 
running time is 0.84 seconds on Epinions while ours is 1.92 
seconds. On Slashdot, it also achieves the best performance 
in terms of efficiency. However, our method enjoys a signifi¬ 
cant improvement of MAE and RMSE compared to all base- 



Epoch 
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Eigure 2: MAE and RMSE in terms of different rank d on the 
Epinions dataset. 










































Table 4: MAE Results on Slashdot Dataset 


Observed entries (%) 

Methods 

SVT 

OPTSpace 

SVP 

RRMC 

MMC 

10 

0.679±0.008 

0.554±0.017 

0.755±0.005 

0.715±0.001 

0.546±0.008 

20 

0.562±0.004 

0.458±0.008 

0.582±0.007 

0.704±0.001 

0.437±0.008 

30 

0.513±0.030 

0.427±0.009 

0.501±0.003 

0.686±0.003 

0.395±0.006 

40 

0.506±0.041 

0.395±0.009 

0.460±0.002 

0.647±0.006 

0.380±0.009 

50 

0.495±0.060 

0.376±0.004 

0.432±0.002 

0.609±0.002 

0.366±0.003 

60 

0.520±0.065 

0.362±0.011 

0.413±0.002 

0.585±0.002 

0.350±0.002 


Table 5: RMSE Results on Slashdot Dataset 


Observed entries (%) 

Methods 

SVT 

OPTSpace 

SVP 

RRMC 

MMC 

10 

0.788±0.008 

0.826±0.021 

0.873±0.004 

0.829±0.001 

0.774±0.008 

20 

0.718±0.020 

0.704±0.013 

0.746±0.007 

0.821±0.001 

0.679±0.009 

30 

0.680±0.043 

0.652±0.006 

0.680±0.003 

0.807±0.002 

0.633±0.008 

40 

0.670±0.056 

0.620±0.009 

0.647±0.003 

0.778±0.005 

0.615±0.011 

50 

0.642±0.055 

0.596±0.009 

0.624±0.003 

0.749±0.002 

0.581±0.006 

60 

0.699±0.084 

0.577 ±0.011 

0.609±0.002 

0.730±0.002 

0.566±0.002 


Table 6: The trade-off between accuracy and efficiency. 


Dataset 

SVT 

OPTSpace 

SVP 

RRMC 

MMC 

MAE 

RMSE 

Time 

MAE 

RMSE 

Time 

MAE 

RMSE 

Time 

MAE 

RMSE 

Time 

MAE 

RMSE 

Time 

Epinions 

0.466 

0.618 

26.76 

0.288 

0.531 

15.26 

0.450 

0.610 

0.84 

0.576 

0.651 

43.11 

0.262 

0.481 

1.92 

Slashdot 

0.618 

0.728 

50.73 

0.458 

0.705 

9.23 

0.582 

0.746 

0.83 

0.715 

0.830 

44.12 

0.437 

0.679 

2.41 


lines. Also, our algorithm is orders of magnitude faster than 
other three baselines. This implies that MMC favors a good 
trade-off between the accuracy and efficiency. 

5.4 Examine The Influence of d 

The non-convex reformulation (I4.21 l requires an explicit rank 
estimation d on the true matrix. In this section, we investigate 
the influence of d on the Epinions dataset as an example. The 
rank d is chosen from [1, 5, 50 ,100, 300, 500] and the results 
are plot in Figure |2l We observe that the rank has little in¬ 
fluence on the performance. This is possibly because that the 
actual data has a low-rank structure (close to rank one). And 
from | |Burer and Monteiro, 2005| , we know that if d is larger 
than the actual rank, any local minimum of Eq. ( 14.2l i is also a 
global optima. 

6 Conclusion and Future Work 

In this paper, we formulated the social trust prediction in the 
matrix completion framework. In particular, due to the spe¬ 
cial structure of the social trust problem, i.e., the measure¬ 
ments are 1-bit and the observed entries are non-uniformly 
sampled, we presented a max-norm constrained 1-bit matrix 
completion (MMC) algorithm. Since SDP solvers are not 
scalable to large scale matrices, we utilized a non-convex re¬ 
formulation of the max-norm, which facilitates an efficient 
projected gradient decent algorithm. We empirically exam¬ 
ined our algorithm on two benchmark datasets. Compared to 
other state-of-the-art matrix completion formulations, MMC 


consistently outperformed them, which meets with recently 
developed theories on max-norm. We also studied the trade¬ 
off between the accuracy and efficiency and observed that 
MMC achieved superior accuracy while keeping comparable 
computational efficiency. 

The max-norm has been studied for several years and in 
many applications, such as collaborative filtering, clustering, 
subspace recovery. It is empirically and theoretically shown 
to be superior than the popular nuclear norm. This work in¬ 
vestigates the power of max-norm for social trust problem 
and demonstrates encouraging results. It is interesting and 
promising to apply max-norm as a convex surrogate to other 
practical problems such as face recognition, subspace cluster¬ 
ing etc. 
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